openMPI/mpich2 不在多个节点上运行

您所在的位置:网站首页 start up片段 openMPI/mpich2 不在多个节点上运行

openMPI/mpich2 不在多个节点上运行

#openMPI/mpich2 不在多个节点上运行| 来源: 网络整理| 查看: 265

我正在尝试在多节点集群上安装 openMPI 和 mpich2,但在这两种情况下我都无法在多台机器上运行。 使用 mpich2 我能够从头节点在特定主机上运行,​​但是如果我尝试从计算节点运行某些东西到不同的节点,我会得到:

HYDU_sock_connect (utils/sock/sock.c:172): unable to connect from "destination_node" to "parent_node" (No route to host) [proxy:0:0@destination_node] main (pm/pmiserv/pmip.c:189): unable to connect to server parent_node at port 56411 (check for firewalls!)

如果我尝试使用 sge 来设置作业,我会遇到类似的错误。

另一方面,如果我尝试使用 openMPI 来运行作业,我将无法在任何远程机器上运行,即使是从头节点。 我得到:

ORTE was unable to reliably start one or more daemons. This usually is caused by: * not finding the required libraries and/or binaries on one or more nodes. Please check your PATH and LD_LIBRARY_PATH settings, or configure OMPI with --enable-orterun-prefix-by-default * lack of authority to execute on one or more specified nodes. Please verify your allocation and authorities. * the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base). Please check with your sys admin to determine the correct location to use. * compilation of the orted with dynamic libraries when static are required (e.g., on Cray). Please check your configure cmd line and consider using one of the contrib/platform definitions for your system type. * an inability to create a connection back to mpirun due to a lack of common network interfaces and/or no route found between them. Please check network connectivity (including firewalls and network routing requirements).

这些机器相互连接,我可以从任何机器到任何其他机器进行 ping、无密码 ssh 等操作,MPI_LIB 和 PATH 在所有机器中都设置得很好。



【本文地址】


今日新闻


推荐新闻


CopyRight 2018-2019 办公设备维修网 版权所有 豫ICP备15022753号-3